A Kernel-Based Communication Fault Injector for Dependability Testing of Distributed Systems
نویسندگان
چکیده
Software-implemented fault injection is a powerful strategy to test fault-tolerant protocols in distributed environments. In this paper, we present ComFIRM, a communication fault injection tool we developed which minimizes the probe effect on the tested protocols. ComFIRM explores the possibility to insert code directly inside the Linux kernel in the lowest level of the protocol stack through the load of modules. The tool injects faults directly into the message exchange subsystem, allowing the definition of test scenarios from a wide fault model that can affect messages being sent and/or received. Additionally, the tool is demonstrated in an experiment which applies the fault injector to evaluate the behavior of a group membership service under communication faults.
منابع مشابه
NFTAPE: A Framework for Assessing Dependability in Distributed Systems with Lightweight Fault Injectors
Many fault injection tools are available for dependability assessment. Although these tools are good at injecting a single fault model into a single system, they suffer from two main limitations for use in distributed systems: (1) no single tool is sufficient for injecting all necessary fault models; (2) it is difficult to port these tools to new systems. NFTAPE, a tool for composing automated ...
متن کاملExperimental Dependability Evaluation of Memory Manager in the Real-time Operating System
The paper presents results of experimental dependability evaluation of the PhoenixRTOS operating system. Experiments are conducted using a self-developed testing environment and a kernel fault injector. Dependability evaluation is the last stage of a system development process. Results will be used in the future research to propose the dependable memory manager.
متن کاملUnderstanding Communication Faults in Parallel Computers
This paper addresses the evaluation of the dependability properties of distributed memory parallel systems through fault injection. The most popular parallel computers are based on the distributed memory architecture where loosely coupled processors communicate by message-passing. Fault tolerance is an issue which increasingly concerns manufacturers and end users of these systems as the probabi...
متن کاملAutomated Fault-Inject Based Dependability Analysis of Distributed Computer Systems
Recently, there has been interest in developing a dependability benchmarks for computer systems. This will require a way to inject several different types of faults into many different platforms and a way to collect and compare the results. Analyzing complex heterogeneous distributed systems share the same needs. The current approach to building fault injection tool is inappropriate for these g...
متن کاملAssessing the Dependability of OGSA Middleware by Fault Injection
This paper presents our research on devising a dependability assessment method for the upcoming OGSA 3.0 middleware using network level fault injection. We compare existing DCE middleware dependability testing research with the requirements of testing OGSA middleware and derive a new method and fault model. From this we have implemented an extendable fault injector framework and undertaken some...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005